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Abstract 

In the paper "A new class of codes for Boolean masking of cryptographic computations ," Carlet, Gaborit, Kim, 
and Sole defined a new class of rate one-half binary codes called complementary information set (or CIS) codes. The 
authors then classified all CIS codes of length less than or equal to 12. CIS codes have relations to classical Coding 
Theory as they are a generalization of self-dual codes. As stated in the paper, CIS codes also have important practical 
applications as they may improve the cost of masking cryptographic algorithms against side channel attacks. In this 
paper, we give a complete classification result for length 14 CIS codes using an equivalence relation on GL(n, F2). 
We also give a new classification for all binary [16, 8, 3] and [16, 8, 4] codes. We then complete the classification 
for length 16 CIS codes and give additional classifications for optimal CIS codes of lengths 20 and 26. 

Index Terms 

CIS codes, classification, computerized search, formally self-dual codes, graph isomorphism 

I. Motivations 

A generalization of self-dual codes was recently proposed by Carlet, Gaborit, Kim, and Sole in Q. In the paper, 
a new class of codes, called complementary information set (or CIS) codes, is defined. Given an integer n, a binary 
linear code with parameters [2n, n, d] which has two disjoint information sets is a complementary information set 
code. CIS codes have a variety of connections and applications; the authors (in [5|) note the direct applications 
found in Cryptography, with relations to Boolean S-Boxes, Boolean functions, and masking lfl3ll . lfl4l . 1 15], [20|. 

Previous results on the classification of rate one-half codes date back to Pless' enumeration of self-dual codes 
in 1972 1181 . Since that time many classification results for self-dual codes have been obtained, most recently the 
doubly-even self-dual codes of length 40 13). A related problem is the classification of formally self-dual codes. 
The main results in this direction are classifications of optimal formally self-dual codes; results have been given 
in HI, J2l, Q, iflOl . Betsumiya and Harada gave a complete classification for even formally self-dual codes up to 
length 16 [I2|; Han, H. Lee, and Y. Lee gave a complete classification for odd formally self-dual codes up to length 
14 IflOl . Some general results on optimal rate one-half codes were obtained by Gulliver and Ostergard in Q. In 
the paper [5], CIS codes are classified for 2n — 2, 4, 6, 8, 10, 12. In the proceeding sections we obtain new results 
in the direction of these previous researchers. 

Notations are introduced in Section UD Graph isomorphism tools used for the classifications are described in 
Sections IHIWI In Section [V"fl we classify all [14, 7] CIS codes (i.e. the case where n — 7). In Section [VlH we give 
new results on the classification of [16, 8, 3] and [16, 8, 4] codes which yields a classification of both [16, 8, 3] and 
[16, 8, 4] CIS codes and [16, 8, 3] and [16, 8, 4] odd formally self-dual codes. In this section we develop an algorithm 
to decide whether a code is CIS. In Section [VIIH some theoretical restrictions on CIS codes with minimum weight 
2 are stated. This allows for the complete classification of length 16 CIS codes. In Section llXl we give an up-to-date 
classification of optimal CIS codes. All computations were completed using MAGMA | 4|; some computations were 
run in parallel and then the output data was compiled at the end. Generator matrices for length 14 CIS codes will 
be posted to the author's website JS). 



II. Notations 

The main notations and basic definitions concerning linear codes are adapted from 111 11 . Let F2 denote the binary 
field. Any subspace C of the vector space F!, 1 is called a linear [n, k) code. All codes we refer to are both binary 
and linear. The Hamming weight of a vector x <E F%, denoted u>i(x) is the number of nonzero coordinates of x. 
The Hamming distance between two vectors x,y £ denoted d(x, y) is the number of coordinates in x and 
y which are different. The minimum weight of a code C is an integer d, where the minimum is taken among all 
non-zero weights of C, C is then referred to as an [n, k, d] code. Two binary codes are said to be equivalent if 
there exists a permutation of coordinates mapping one code onto the other code. 
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Given two vectors in F£, v = a\ai . . . a n and u — &1&2 ■ • ■ b n , the Euclidean inner product of u with v is the 
sum u • v := a\b\ + 0262 ■ • ■ + a n b n . The dual of C is the set C := {x 6 F£ : x • v = for all v G C}. The 
number of codewords of weight w in a code C (resp. C ) is denoted by A^ (resp. A^;). C is called self-dual if 
C = C . C is called formally self-dual if = for all weights u>. 

A k by n matrix G is called a generator matrix for an [n, fc] code C if the rows of G form a basis for C. Any 
set of k columns of G which are linearly independent is called an information set for C. As stated above, the 
authors in [5 1 define a complementary information set code to be a binary linear code with parameters [2n, n, d] 
which has two disjoint information sets. 

III. A Classification Tool Using Graph Isomorphism 

The classification of binary [n, k, d] codes satisfying various properties is a classical problem; as mentioned 
above, previous work in this direction includes the classification of self-dual codes, formally self-dual codes, and 
rate one-half codes in general. Thus, an interesting problem in the area of CIS codes is the classification problem. 
One main difficulty that arises when classifying codes is the equivalence test. When comparing a small set of codes 
the equivalence test can be implemented easily (in MAGMA |4|) by performing a pairwise comparison of all codes 
in the set. However, when comparing more than a few thousand codes the test becomes rather time consuming. 
In essence this is a combinatorial problem of classifying objects up to a defined equivalence. A useful solution 
for this problem, proposed independently in 1978 by |]6], QjQ , is to generate a list of inequivalent combinatorial 
objects (codes) by producing a "canonical representative" for each equivalence class. This method is described by 
Kaski and Ostergard and it is called Orderly Generation ( flT2l pp. 120-124). There is no equivalence test in this 
method, the only criterion is set membership. 

The difficulty in applying the Orderly Generation method is finding a way to determine a "canonical represen- 
tative" for each equivalence class. As suggested in [12], a clever navigation of this difficulty is to make use of 
Brendan McKay's graph isomorphism program nauty lfl6l . Two graphs G and G 1 with vertex sets V and V' are 
said to be isomorphic if there exists a bijection <\> : V —> V such that (u, v) is an adjacent pair of vertices in G if 
and only if {4>(u), <j>{v)) is an adjacent pair of vertices in G' . Given a graph G with vertex set V and a fixed labeling 
on the vertices with the integers 1, 2, \V\, nauty can output a "canonical" labeling among all isomorphic graphs. 
In fact, if the graph is a colored graph, then nauty will give a canonical labeling which preserves the color among 
labels. In 1 17 1, Ostergard uses nauty functionality to classify binary linear codes of minimum distance greater than 
two for up to length 14. In ||2T1 . Schaathun implements a search which classifies all [36, 8, 16] linear codes using 
nauty. 

IV. A Correspondence Between Codes and Graphs 

Now we must describe how to transform a linear code to a colored graph. As per the formulations in lfT2l . ifPTl , 
ll2D . let a linear [n,k,d] code C be given. Let S be the set of minimum weight in C. If S does not generate 
C, then include all codewords in C of weight 1 higher than the maximum weight in S. Repeat the last step until 
S generates C. Fix an ordering on S so that a represents a specific element of S for i e {1,...,|5|}. Construct 
a set of |5| + n vertices labeled with the integers 1,2, \S\ + n (denote Vi the vertex with label i). Construct 
a bipartite graph in the following way. Let {ui, i?2, v\g\} be one partite set, and let the other partite set be 
{ v \S\+ii v \S\+2, ■■■,v\s\+n}- Draw an edge (vi,v\s\+j if and only if Cj has a 1 in coordinate j. Color vertices 
{vx,V2, v\s\} black. Color vertices {v\s\+i,v\s\+2i ■■■■> v \S\+n} re d- The following lemma is adapted from the 
known methods described in IPT21 . ifTTl . Il2p . 

Lemma IV. 1. A permutation ct\ of the labels on the black vertices corresponds to a permutation of the ordering 
on the codewords. A permutation «2 of the labels of the red vertices corresponds to a permutation of columns 
of codewords. As a result, applying ct\ and ct2 to a graph G {constructed from a code C'), yields a graph G' 
(corresponding to a code C' equivalent to C). 

Proof: The first claim is clear from the construction since Cj corresponds to vertex V{. The second claim 
follows from the fact that if ot2(v\s\+i) = v \s\+j' then a ^ codewords which had a 1 in column i, now have a 1 in 
column j after applying ot2- 

Since a.\ and ct2 correspond to permuting generators and columns in the code C to obtain G', then G' must 
correspond to a code C' equivalent to C. ■ 

Because of the functionality in nauty, a canonically labeled graph (with the color restriction described above) 
corresponds to a canonical form of a linear [n, k, d] code. Therefore we may apply the Orderly Generation method. 
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V. A Correspondence Between GL(n, F 2 ) and Graphs 

The general linear group of degree n, denoted GL(n,¥ q ), is the set of all n by n invertible matrices, over ¥ q , 
under matrix multiplication. Given any linear [2n,n, d] code C, it is clear that if the coordinate set {1,2, ...,n} 
forms an information set, then any generator matrix of C has the form G = [I\A], after performing Gaussian 
Elimination, where / is the n by n identity matrix and A is an n by n matrix. In (5] this is called the systematic 
form of the generator matrix for a [2n, n, d] code C. C is CIS if and only if C may be converted to systematic 
form where A G GL{n, ¥2), by Lemma IV.l of 0. Hence if the equivalence classes of GL(n,¥2) are classified, 
then the classification of CIS codes can be obtained using the ideas of Section [IV] Therefore an interesting related 
classification problem is to find all equivalence classes of GL(n,¥ 2 ) (under row and column permutations). 

Consider the following notion of equivalence on GZ-(n,F 2 ): two matrices A, B G GL(n,¥ 2 ) are equivalent, 
A ~ B, if and only if A = PiBP 2 if Pi and P 2 are two n by n permutation matrices. 

Observation V.l. ~ is an equivalence relation on GL(n, F2). 

Proof: Let A,B,C £ GL(n, ¥ 2 ). The notation Pj for an integer i denotes a permutation matrix. Reflexivity is 
clear since if P e is the identity permutation, then A = P e AP e . Symmetry holds since the inverse of a permutation 
matrix is a permutation matrix: A = P1BP2 implies B = P^ 1 AP 2 ~ 1 . Transitivity holds since the product of two 
permutation matrices is a permutation matrix: A = P\BP 2 and B = P3CP4 implies A = (PiPg)G(P4P2). ■ 

Observation V.2. If A,B 6 GL(n, ¥2) are such that A ~ B, then the CIS codes with systematic generator 
matrices [I\A] and [I\B] are equivalent. 

Proof: By definition, A = P1BP2 where Pi and P2 are two n by n permutation matrices. Let B' = PP2. 
The code generated by Pi[/|P'] is equivalent to the code generated by [I\B] by permuting the columns of B and 
the rows of the entire generator matrix. However, Pi[/|P'] = [P1KP1PP2)] — [Pi\A] since P\I = P\. Finally to 
obtain \I\A\ from [Pi|A] simply permute the columns of Pi by applying Pj - . ■ 

Remark V.3. The converse of Observation IV.2I is not true in general. The smallest counterexample is for n = 3. 
Consider the following matrices from GL(3,¥2) 





' 111 " 




" 110 " 


A = 


Oil 


,B = 


Oil 




001 




001 



Note that A is not equivalent to B since A has a row of weight 3 and B does not. However, it is clear to see 
that the code generated by [I\A] is equivalent to the code generated by [I\B] by permuting the second and fifth 
columns and performing row elimination. 

To use nauty, we now describe how to transform an element of GL(n, ¥2) to a colored bipartite graph. Similar 
to the method of Section HVl let A G GL(n, F 2 ). Construct a set of 2n vertices labeled with the integers 1, 2, 2n 
(denote Vi the vertex with label i). Construct a bipartite graph in the following way. Let {vi, V2, v n } be one 
partite set, and let the other partite set be {w„+i, v n +2, V2n}- Draw an edge (vi, v n +j if and only if row i has a 
1 in column j. Color vertices {vx, V2, v n } black. Color vertices {v n +i, v n +2, «2n} red. The following lemma 
is adapted from the known combinatorial formulations in IPT21 . 

Lemma V.4. A permutation a row (resp. a co i) of the labels on the black (resp. red) vertices corresponds to 
a permutation of rows (resp. columns). As a result, applying a row and a co i to a graph G (constructed from 
A G GL(n,F 2 )J, yields a graph G' (corresponding to an equivalent matrix A' G GL(n, F2)). 

Proof: The first claim follows from the construction since a row corresponds to a vertex in {wi,w 2 , ...,v n } 
and a column position corresponds to a vertex in {u„+i, v n+ 2, «2n}- 

Since a row and a co i correspond to permuting rows and columns in the matrix A to obtain G', then G' must 
correspond to a matrix A' equivalent to A. ■ 

VI. [14,7] CIS Codes 

For length 14 an optimal CIS code is mentioned in |5|; this code is self-dual with parameters [14,7,4]. In 
order to apply the theories developed in the previous section we need a construction method for the elements of 
GL(n, ¥2). Our aim in this section is to first classify elements (up to equivalence) in G£(n, F2) for n < 7, then 
we use these elements to classify all CIS codes of length 14. 
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We first discuss how to obtain matrices in GL(n, F2) using inequivalent matrices from GL(n — 1,F2). The 
following two lemmas are adapted from Lemma VI. 3 and Proposition VI. 4 of 0. 

Lemma VI. 1. Any matrix A G GL(n, ¥2) has a submatrix A 1 G GL(n — 1,F2). 

Proof: Let <2j be the ith column of A and let r*j be the ith row of A where 1 < i < n. Delete a\ from A to 
obtain an n by n — 1 matrix A\. Let r[ be the ith row of A\. Since A\ has rank n — 1, there exists a j such that 
{ r 'i '■ i j} 316 linearly independent and = /Jcjr^ for uniquely determined c;. Therefore by deleting from 

Ai we obtain an n — 1 by n — 1 matrix A' having rank n — 1. ■ 

Lemma VI.2. For any matrix A 1 G GL{n — 1, F2), a matrix A G GL(n — 1, F2) 'way »e obtained by the following: 
For any x, y G Fj -1 , _/w c := xA^ 1 and z := [1] + cy T , f/ien 




Proof: Since the rows of A' are linearly independent a; must be a linear combination of the rows of A', which 
implies there exists a c G F 2 l_1 such that cA = x. Solving for c we obtain c = xA^ 1 . To ensure that the top row 
of A is linearly independent from the other rows the value of z must be such that c[j/ T /l'] ^ [z x]. Hence cy T ^ z, 
and as the values are binary this is equivalent to cy T + [1] = z. ■ 
By applying this theory recursively to all representatives from equivalence classes of GL(n — 1,F 2 ) along with 
the canonical selection method in Section[V]we may obtain all equivalence class representatives in GL(n, ¥2). For 
n = 1, 2, 7 we have obtained the number of equivalence classes given in Table U 

TABLE I 

Number of Equivalence Classes in GL(n, F 2 ) Under Row & Column Permutations 



n — 


1 


2 


3 


4 


5 


6 


7 


Total 


1 


2 


7 


51 


885 


44,206 


6,843,555 



TABLE II 

Classification of Length 14 CIS codes 





Total CIS 


SD 


Only FSD 


Not SD or FSD 


d= 2 


62,015 


3 


4,407 


57,605 


d= 3 


22,561 





2,160 


20,401 


d= 4 


1,476 


1 


121 


1,354 


Total 


86,052 


4 


6,688 


79,360 



For each representative A from equivalence classes of GL(n, ¥2), appending the n by n identity matrix /, [I\A] 
is a generator matrix for a CIS code. By applying the method introduced in Section [IV] we can then obtain a set 
of all inequivalent CIS codes of length 2n. Hence we obtain the following classification theorem. 

Theorem VI.3. There are exactly 86,052 [14, 7] CIS codes. 

Additional information for the [14, 7] CIS codes is listed in Table Ull the rows give the possible minimum distances 
and the columns tell how many are self-dual, formally self-dual but not self-dual, and neither. 

VII. The [16,8,3] and [16,8,4] Codes 

For length 16 an optimal CIS code is mentioned in |3); this code with parameters [16,8,5] was shown to be 
unique by Betsumiya and Harada and in fact this code is formally self-dual (TJ. A classification for length 16 
CIS codes would be interesting, but since the number of equivalence classes of GL(7,F2) is very large, it is not 
feasible to determine the classes of GL(8,F2). Hence the construction from the previous section can not be used. 
Therefore we consider another method for determining the [16,8,3] and [16,8,4] CIS codes. The method we use 
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is to generate all binary linear [16,8,3] and [16,8,4] codes and then determine which ones are CIS. To ease the 
computation we make use of the well known fact that any [n, k] odd code contains a unique [n, k — 1] subcode 
generated by all even codewords. Also, it is clear that any [16, 7] subcode of any [16, 8] even code is even. Therefore 
we first classify all [16, 7, > 4] even codes. We give the following lemma based on the theory of shortening codes 
in ifTTl to justify our method for finding the [16, 7, > 4] even codes. 

Lemma VII.l. If C is a binary [n, k] code with generator matrix in standard form G, then shortening C on the 
first column yields an [n, k — 1] code. 

Proof: Since G is in standard form the only row of the generator matrix with a 1 in the first column is the 
first row. Therefore, shortening on the first column yields an [n, k — 1] code. ■ 
Applying this lemma recursively to any [n, k, d) code, a nested chain of subcodes is obtained, the smallest 
subcode having parameters [n — k + 1, 1, > d}. Therefore, any [16, 7, > 4] code has a nested chain of subcodes 
("subcode" meaning by adding a zero column it is a subcode): 

[16, 7, > 4] D [15, 6, > 4] D [14, 5, > 4] D [13, 4, > 4] 

D [12, 3, > 4] D [11, 2, > 4] D [10, 1, > 4] 

If we have a list of all inequivalent [n' , k' , > 4] codes L we construct all [n' + 1, k' + 1, > 4] supercodes by adding 
a zero column onto each code C in L and then increasing the dimension by adding vectors from FJ? +1 /C. This 
method is somewhat opposite from the method described in |jTT| which instead adds columns to the parity check 
matrix. We apply the method recursively and keep only "canonical" representatives as in Section |IV] to obtain a 
classification of 29,243 total inequivalent [16, 7, > 4] even codes. 29,240 of these codes are [16, 7, 4] and 3 of them 
are [16, 7, 6] (the [16, 7, 6] codes were previously classified by Simonis in ||22~1 ). 

Our goal is to classify all [16,8,3] and [16,8,4] codes, so the next step is for each [16,7, > 4] even code C 
we add all possible vectors x from F^/C to form codes C+ < x >. We keep a list of all [16, 8, > 3] codes 
generated in this way. To determine which codes are inequivalent we keep only the "canonical" representatives as 
in Section JV] Our conclusion from this search is the following theorem. 

Theorem VII.2. There are exactly 2,914,299 binary [16, 8, 3] codes and there are exactly 271,783 binary [16, 8, 4] 
codes. 

In the Tables [Til] and [IV] we have the totals for how many of these codes are self-dual, only even formally- 
self-dual, only odd formally self-dual, and neither self-dual nor formally self-dual; there we note the previously 
classified self-dual codes [18| and even formally self-dual codes J2j. We also include a column which states how 
many have d 1 - ^ 1, which means there are no zero columns in the generator matrix. 

Since we have a list of all inequivalent [16,8,3] and [16,8,4] codes we may then pursue the main goal of 
classifying the ones which are CIS. To determine if a code is CIS we use the following algorithm. 

CIS Determination Algorithm: An algorithm to determine if a given code is CIS. 

(i) Input: Begin with a binary [2n, n] code C. 

(ii) Output: An answer of "Yes" if C is CIS and "No" if not. 

a) Fix a generator matrix G of C. Initialize an empty set to hold used integer subsets U := {}. 

b) Fix the first column of G by initializing / := {1} (/ holds the columns of an information set we are 
building). 

c) Choose n — 1 integers {ii, i n -i\ from the set {2, 3, . . . , In — 1, 2n} such that {t\, i n _i} ^ U . 
Include {ii, i n ~i} as an element in U. Let J := I U {t\, 

d) If the columns of G indexed by / are not linearly independent or indexed by {1, 2, 3, . . . , In — 1, 2n}\I 
are not linearly independent, then go to (e). Otherwise, if the columns of G indexed by / are linearly 
independent and indexed by {1, 2, 3, . . . , 2n — 1, 2n} \ I are linearly independent, then output "Yes" 
and exit algorithm. 

e) If \U\ = ( 2 ™Zi)i th en output "No" and exit algorithm; otherwise, go back to (b). 

This algorithm searches through all possible information sets of size n and checks if the complement is also 
an information set. The first column may be fixed in step (b) since without loss of generality the first column 
appears in one information set of size n in any CIS code. This algorithm searches through at most ( J!Ti ) possible 
information sets; if the code is determined to be CIS the algorithm is exited early in step (d). 
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To examine a general [16, 8] code to determine it is not CIS, the algorithm will have to search ( „) = 6435 infor- 
mation sets. For a single code this takes approximately 2.129 seconds. However, since there are 2,914,299+271,783= 
3,186,082 codes to examine, applying the algorithm one code at a time would take at most approximately 78.5 
days. Instead we applied the algorithm in parallel to separate codes to decrease the execution time. By applying 
this algorithm to all [16,8,3] and [16,8,4] codes we obtain the following conclusion. 

Theorem VII.3. There are exactly 2,711,027 [16, 8, 3] CIS codes and there are exactly 267,442 [16, 8, 4] CIS codes. 

TABLE III 

Classification of [16, 8, 3] codes and [16, 8, 3] CIS codes 





Total 


d x ^ 1 


Odd FSD 


Not FSD 


All [16, 8, 3] 


2,914,299 


2,780,328 


162,423 


2617905 


CIS [16,8,3] 


2,711,027 


2,711,027 


162,406 


2,548,621 



TABLE IV 

Classification of [16, 8, 4] codes and [16, 8, 4] CIS codes 





Total 


d x ^ 1 


SD 


Only Even FSD 


Odd FSD 


Not SD or FSD 


All [16,8,4] 


271,783 


268,261 


3 1 IS | 


141 12) 


12,827 


255,290 


CIS [16, 8, 4] 


267,442 


267,442 


3 


141 


12,827 


254,471 



As a tangential result to Theorem IVII.2I we examine the number of odd formally self-dual codes. The recent 
results on odd formally self-dual codes are given in iPTOl . where odd formally self-dual codes are classified for 
lengths up to 14 and additional results are given on optimal odd formally self-dual codes. We give a classification 
of odd formally self-dual [16,8,3] and [16,8,4] codes in the following corollary to Theorem IVII.2I 

Corollary VII.4. There are exactly 162,423 odd formally self-dual [16,8,3] codes and there are exactly 12,827 
odd formally self-dual [16,8,4] codes. 

VIII. Restrictions on CIS Codes of Minimum Distance 2 

The following proposition and its corollaries give restrictions on the structure of the systematic generator matrix 
of a CIS code with minimum distance 2. This allows for some interesting theory for constructing CIS codes with 
minimum distance 2. 

Proposition VIII.l. Let C be a [2n, n, 2] CIS code with generator matrix in systematic form G = [I\A], If x is a 
weight 2 codeword of C, then x is a row of G. 

Proof: Suppose to the contrary that x is not a row of G. Let ij be the jth row of / and aj be the jth row of A, 
hence let ijGj be the jth row of G. Since a; is a codeword of C there exists a linear combination x = Y^j=i c jih a j) 
where Cj G F2. Which implies the equation 2 = wt(x) — U>t(VJ" =1 Cj(ij)) +wt(yj"_ 1 Cj(dj)). As x is not a row 
of G then at least two CjS are nonzero. However, the support of the ijS do not intersect; so if more than two CjS are 
nonzero, then the weight of x is greater than two. Therefore exactly two CjS are nonzero; let these be Cf and Cjn. 
So now wticji (ij') + Cj" (ij")) — 2, which implies by the above weight equation that wt(cj> (djt) + Cj" (%")) = 0- 
This is a contradiction since it implies the rows of A are not linearly independent. ■ 

Corollary VIII.2. If C is a CIS code with minimum weight 2, then all weight 2 codewords have disjoint support. 

Proof: By Proposition IVIII.ll the weight 2 codewords appear in the generator matrix with systematic form. If 
two weight 2 codewords do not have disjoint support, then their corresponding rows in A will not be independent. 

■ 

Corollary VIII.3. Let C be a [2n, n, 2] CIS code with generator matrix in systematic form G — [I\A\. If a weight 
2 codeword of C has support {k\, IZ2} with k\ < k% then k\ £ {1, 2, . . . , n} and k% € {n + 1, n + 2, . . . , 2n}. 
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Proof: By Proposition IVIII.ll the weight 2 codewords appear in the generator matrix with systematic form. If 
the claim is not true, then either I or A will have an all zero row which is a contradiction. ■ 
Proposition IVIII.ll and its two corollaries give some insight into the construction of CIS codes of minimum 
weight 2 and allow us to obtain the following theory which is unique to the construction of CIS codes of minimum 
weight 2. 

Proposition VIII.4. All [2n, n, 2] CIS codes (up to column permutation) can be obtained from a list of all 
inequivalent CIS codes of length 2n — 2. 

Proof: Let C be any [2n, n, 2] CIS code. Without loss of generality the first row of a systematic generator 
matrix G of C is a weight 2 vector. By Lemma VI. 3 and Propositions VI.4 and VI. 6 of [5] G has the following 
form: 



1 


00 • • • 


1 


00 • • • ' 





/ 


y T 


A 



where is the all zero column of length n — 1, y € , and [I\A] is a generator matrix in systematic form for 
a [2n — 2, n — 1] CIS code C' . It is noted in Remark VI. 8 of [5| that using the Building-up Construction on a CIS 
code C' of length 2n — 2 may produce inequivalent sets of CIS codes when a different systematic partition of C' 
is used. However, this problem does not occur in this case since all zeros appear above / and A in the generator 
matrix G; hence permuting columns of C' to obtain a different systematic form [I\A'] is the same as permuting 
the columns of G corresponding to / and A. ■ 
For any length 2n there is special CIS code that can be constructed with n weight 2 vectors by the following 
proposition. It may be noted that this code is self-dual. 

Proposition VIII.5. There is a unique CIS code of length 2n containing n codewords of weight 2. 

Proof: This code is a [2n, n, 2] CIS code. The generator in systematic form has all weight 2 rows with disjoint 
supports which follows from Proposition IVIII. ll and its corollaries. ■ 
By applying Proposition IVIII. 4l to all CIS codes of length 14 from Section [VTl all [16,8,2] CIS codes may be 
obtained. We implemented this in MAGMA to deduce the following theorem. 

Theorem VIII.6. There are exactly 4,798,598 [16,8,2] CIS codes. 

This completes the classification of length 16 CIS codes. In the style of Table HI1 we compile the information 
from Section [VIl] and the information on the unique optimal [16,8,5] CIS code (from J5)) to give Table M 

TABLE V 

Classification of Length 16 CIS codes 





Total CIS 


SD 


Only FSD 


Not SD or FSD 


d = 2 


4,798,598 


4 


150,080 


4,648,514 


d = 3 


2,711,027 





162,406 


2,548,621 


d = 4 


267,442 


3 


12,968 


254,471 


d = 5 


i (U 





i m 





Total 


7,777,068 


7 


325,455 


7,451,606 



IX. On the Classification of Optimal CIS Codes 

In Q the authors gave examples of codes with best known minimum distance which are in fact CIS for lengths 
2 through 130. The authors also give an example of a code which is optimal but not CIS. Therefore an interesting 
problem is the classification of optimal CIS codes (and the determination of optimal codes which are not CIS). 

In Section lVTl we classified the length 14 CIS codes. In [9], it was determined that there exist exactly 1535 optimal 
[14, 7, 4] codes. We reconstruct those codes applying the method described in Section IVIII and then examine the 
ones that are not CIS. In doing so we obtain the following interesting result. 

Proposition IX.l. There are exactly 59 optimal [14, 7, 4] codes which are not CIS. 47 of these codes have dual 
distance 1 and the other 12 have dual distance 2. 



8 



Proof: The 47 with dual distance 1 are not CIS by Proposition IV.5 of [5|. We determine that the remaining 
12 are not CIS by applying the CIS Determination Algorithm from Section [VTll ■ 

The 12 [14,7,4] codes listed in the proposition are optimal rate one-half codes of the smallest length with 
d = 2 which are not CIS. It is noted in Proposition IV.6 from [5|, that there exists at least one optimal code 
(with parameters [34, 17, 8]) which is not CIS. The noted [34, 17, 8] code has dual distance 1. In Proposition lIX.il 
we give a first example of optimal codes with dual distance greater than 1 that are not CIS. 

Optimal [20, 10,6] and [26, 13, 7] were determined in [9|. By applying the CIS Determination Algorithm to all 
1682 optimal [20, 10, 6] codes and all 3 optimal [26, 13, 7] codes we determined the following result. 

TABLE VI 

Classification of Optimal CIS and non-CIS Codes 



2n 


d opt |9| 


CIS 


not CIS 


Total [9| 


2 


2 


1 [5J 





1 


4 


2 


2 [5| 


1 


3 


6 


3 


1 





1 


8 


4 


1 





1 


10 


4 


4 [5| 





4 


12 


4 


41 


2 


43 


14 


4 


1476 


49 


1535 


16 


5 


1 





1 


18 


6 


1 





1 


20 


6 


1682 





1682 


22 


7 


1 





1 


24 


8 


1 





1 


26 


7 


3 





3 


28 


8 


1 |5| 





1 



Proposition IX.2. All 1682 optimal [20, 10, 6] codes are CIS and all 3 optimal [26, 13, 7] codes are CIS. 

In Table |VI] we give the results which have been obtained so far in this direction. Known optimal CIS codes 
described in [5 | are cited in the table. New results on determining which optimal codes are CIS and not CIS are 
labeled in bold. Column 2n is the length. The second column is the optimal minimum distance for any [2n, n] 
(n < 14) code determined in |9j. The columns 3 and 4 give the number of codes which are CIS and not CIS 
respectively. The last column is the total number of optimal codes which was determined in (9). 

X. Conclusion 

Many open problems in Coding Theory are concerned with the classifications of codes. Results in this direction 
have been obtained for self-dual codes, formally self-dual codes, and rate one-half codes in general. A recent 
generalization of self-dual codes, called CIS codes, was proposed in |5| and full classification results were given 
for lengths up to 12. In the present paper we complete the classification of CIS codes for length 14 and 16. We 
also give a classification of all binary [16,8,3] and [16,8,4] codes which in turn yields new classification results 
for odd formally self-dual and CIS codes with parameters [16, 8, 3] and [16, 8, 4]. In the final section, we complete 
the classification of optimal CIS codes for lengths 20 and 26. 

Acknowledgments: The author would like to thank Jon-Lark Kim for the helpful comments and valuable suggestions. 
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